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Abstract 

Let E be an elliptic curve over Q. In 1988, Koblitz conjectured a precise asymptotic for 
the number of primes p up to x such that the order of the group of points of E over F p is 
prime. This is an analogue of the Hardy and Littlewood twin prime conjecture in the case 
of elliptic curves. 

Koblitz's conjecture is still widely open. In this paper we prove that Koblitz's conjecture 
is true on average over a two-parameter family of elliptic curves. One of the key ingredients in 
the proof is a short average distribution result in the style of Barban-Davenport-Halberstam, 
where the average is taken over twin primes and their differences. 
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1 Introduction 

A well-known open problem in number theory is the twin prime conjecture, which states 
that that there exist infinitely many primes p such that p + 2 is also a prime. This conjecture 
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was generalized by Alphonse de Polignac in 1849 to the statement that, for any even integer 
there exist infinitely many primes p such that p + r is also a prime. In 1922, G.H. Hardy 
and J. Littlewood made this statement precise, predicting that, as x — > oo, 

#{f < x : p + r is prime} ~ 6(r) 

where 



log 2 x ' 



6(r) := 



2 nf^l H — if 2|r 
otherwise. 



Here and everywhere in the paper, p and i are used to denote primes. 

Even though still inaccessible by current methods, the twin prime conjecture has generated 
tremendous advances in number theory. Indeed, in 1919 Viggo Brun |Brj developed what is now 

known as the Brun sieve to prove the surprising result that y — < oo. Brun's methods 

z — ' p 

V r 
p+2 prime 

opened the way to sieve theory, leading to upper bounds of the right order of magnitude for the 
number of twin primes p < x and to the important achievement of Jingrun Chen |Che] from 
1966 that < x : p + r = P2} 3> ^2 - , where, for an integer k, P^ denotes the product 
of at most k primes. This result relies on another important application of sieve theory, the 
Bombieri- Vinogradov theorem on averages of primes in an arithmetic progression, obtained 
independently by E. Bombieri and A.I. Vinogradov in the mid 1960s. In the late 1980s, H. Maier 
and C. Pomerance, and, subsequently, the first author of this paper, obtained similar Bombieri- 
Vinogradov type results concerning averages of twin primes (see [MaPoJ, [BaJ), by building on 
previous work of N.G. Chudakov, A.F. Lavrik, H.L. Montgomery and R.C. Vaughan. 

The twin prime conjecture can be generalized in many directions. For instance, the Hardy- 
Littlewood heuristics can be used to predict the (same) asymptotic formula for the number of 
primes p < x such that is also a prime. This question may be reformulated as counting the 
number of primes p < x such that the group F*\{±1} is of prime order. Such a reformulation 
may then be easily generalized to other groups, say to the group of points of an elliptic curve: 
given an elliptic curve E/Q over the field of rational numbers, count the number of primes 
p < x of good reduction for E such that the group E(¥ p )/E(Q) toIS is of prime order, where 
E(¥ p ) denotes the reduction of E modulo p and E(Q)t ors denotes the torsion subgroup of E/Q. 
This question has theoretical relevance to elliptic curve cryptography and was first considered 
by Neal Koblitz in 1988: 
Koblitz's Conjecture [KoJ 

Let E/Q be an elliptic curve defined over the field of rational numbers. We assume that E is 
not Q_-isogenous to an elliptic curve with non-trivial Q-torsion subgroup. Then there exists a 
positive constant C(E) such that, as x — > oo, 

twin 



ir^ m (x) := #{p < x : \E(¥ p )\ is prime} ~ C{E) 



log x 



A candidate for the explicit constant C(E) was given by Koblitz in his paper and was later 
corrected by D. Zywina |Zy| . It is described in detail in Section [2] in the generic case that E/Q 
is without complex multiplication. 

It is useful to write the number of points of E over F p as 

\E{¥ p )\=p+l-a p {E), 
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where a p (E) satisfies the Hasse bound |a p (i£)| < This also makes the analogy between 

Koblitz's Conjecture and the twin prime conjecture more apparent. Exploiting this analogy, one 
can employ sieve methods to find partial results towards Koblitz's Conjecture. This approach 
was initiated by S.A. Miri and V.K. Murty MiMu] and further refined by A. Steuding and 
J. Weng [SfrWe], the second author [Co| . and H. Iwaniec and J. Jimenez Urroz |IwUrj . We 
currently know upper bounds of the right order of magnitude for TTg m (x) ( |Co| ). provided the 
Generalized Riemann Hypothesis holds if E/Q is without complex multiplication, and various 
lower bounds in the style of Chen's result ( [MiMu] . [StWej . [Coj . |IwUr] ) . Regarding lower 
bounds, the best result that one may hope to achieve by sieve technology was obtained by 
|IwUr| for the complex multiplication elliptic curve E : y 2 = x 3 — x. They showed that 

# [p < x : p = l(mod4), ~ \E(¥ p )\ = P 2 ) » 

I 8 J log a; 

The main purpose of this paper is to prove the validity of Koblitz's Conjecture on average 
over a set of elliptic curves 



Theorem 1 Let x > be a variable and let A = A{x),B = B(x) be parameters such that 
A,B> x l l 2+£ and AB > x 3 / 2+£ for any fixed e > 0. Let C be the set of elliptic curves 
E{a, b) : Y 2 = X 3 + aX + b, where a,b £ Z with \a\ < A, \b\ < B. Then, as x — ► oo, 

\ ^ twin /*™\ rf X 

\C\ ^ log X 

where (£ is the non-zero constant 

2 yr t - 2£ 3 - £ 2 + 31 _ -j-r / f - t - 1 

: " 3 11 (*-l)3(*+i) - 11 V " (^-1) 3 (^ + 1) 

As will be shown in Section [21 the average constant £ gives further evidence for the conjec- 
tural constant of Koblitz's Conjecture. 

The first steps in the proof of Theorem [T] follow the ones in the proof of the average Lang- 
Trotter Conjecture on Frobenius traces obtained by the third author and F. Pappalardi in 
[DaPa], with an improvement due to S. Baier [Baij to shorten the average in terms of A and 
B (in [DaPaj, one needed to take A, B > x 1+e ). More precisely, we first reduce the average of 
Koblitz's Conjecture given in Theorem [T] to an average involving only elliptic curves over the 
finite field F p ; we then use Deuring's Theorem to rewrite this as an average of Kronecker class 
numbers. These steps are described in detail in Section O 

As corollaries, we obtain the average result of Theorem [H as well as: 

Theorem 2 Let 

TT*(p) ~#{E/F p : \E(¥ p )\ is prime} . 

Then, as x — > oo, 

£vr*(p) ^ 



3 log 2 x ' 

p<x 



where (£ is the constant of Theorem{J\ 
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One remarks that Theorem [2] is only concerned with the distribution of elliptic curves over 
F p having a prime number of points. More properties of this distribution could be obtained by 
considering the higher moments 

M fc (x) := (*"*(?))* for k > 1. 

p<x 

One of the key ingredients in the proof of Theorems [I]|2] is an average of the standard twin 
prime conjecture. Such averages were first considered by N.G. Chudakov [Chu] and A.F. Lavrik 
[Lavl 



and then, among others, by A. Balog [Ba] . who added distribution in residue classes, and 
by A. Perelli and J. Pintz |PePi] . who shortened the average. The length of the average needed 
for our application is dictated by Hasse's bound and is short {yfx compared to x); additionally, 
we also need distribution in residue classes. Such a mixture of additional features is not in the 
literature and is proven in our paper: 

Theorem 3 Let x > and let e, M > 0. Then there exists an integer N(M) > such that, 
for any x 1//3+£ < R < x, N > N(M), Q < xlog~ N x, and X, Y satisfying X + Y < x, we have 



where 



E E E 

0<\r\<Rq<Q a(modq) 



logp • logp' — 6(r, q, a)Y 



X<p<X+Y 
p = a(mod q) 
p—p f = r 



Rx 2 



log M x' 



&(r,q,a) := { <p{q) 



&(rq) if 2 | r, (a,q) = (a-r,q) = 1, 







otherwise, 



and <5(rq) is as in UP and 4>{q) is the Euler function of q. Here (and in what follows), q denotes 
positive integers, andp,p' and £ denote rational primes. 

The structure of the paper is as follows. In Section [21 we present the heuristic reasoning 
behind Koblitz's Conjecture and discuss the constant In Section [3l we reduce the statements 
of Theorems [T]l2] to an average of Kronecker class numbers (Proposition [7]). In Section HJ we 
show how an average of the twin prime conjecture implies Proposition [71 Finally, in Sections 
061 we give the proofs of the afore- mentioned average of the twin prime conjecture and of 
Theorem [3l 



2 Average of Koblitz's Conjecture and the conjectural constant 

The constant C{E) of Koblitz's Conjecture is based on the following heuristic argument, 
which is reminiscent of the argument leading to the classical twin prime constant of Hardy and 
Littlewood (see, for example, }Soj). 

We want to count the number of primes p such that p + 1 — a p (E) is also a prime. For 
each prime I, this means that p + 1 — a p (E) is not divisible by I. For a random integer n, the 
probability that I \ n would be (I — !)/£■ To compute the probability that £ \p + 1 — a p (E) we 
consider the action of Gal(Q/Q) on the group E[£] of ^-torsion points of E, which leads to the 
injection 

pi : Gal(Q(£[£])/Q) ^ GL 2 (Z/£Z); 
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here, Q(E[£]) is the field obtained by adjoining to Q the coordinates of the points in E[£]. By 
studying the action of the Probenius map on the torsion points of E, it follows that 



tr(pe((Tp)) = a p (E)(mod£), 
det(pe(a p )) = p(modf), 

for all primes p ^ £ of good reduction for E. Then the probability that £ \ p + 1 — a p (E) can be 
evaluated by counting matrices g in GL/2(Z/^Z) such that det(g) + 1 — tr(g) ^ 0(mod£). 
Let G(£) be the image of pi in GL2(Z/tt) and let 

fi(£) : = {g e G(£) : det(g) + 1 - tr(g) = 0(mod£)} , 
Q'{£) := {g G GL 2 (Z/«) : det( 5 ) + 1 - tr(#) = O(mod^)} . 

Then, at each prime £, the correcting probability factor is the quotient 

M£)\ 



l 



\G(£)\ 



1 ' 

where the numerator is the probability that p+1— a p (E) is not divisible by £ and the denominator 
is the probability that a random integer is not divisible by £. 
If G{£) = GL 2 (Z/£Z), then we have 

1 m)\ 1 i^'wi 



\G(£)\ |GL 2 (Z/^Z) 



,_1 x _l (£-1)3(^ + 1) 



The constant C{E) of [KoJ is defined as the product over all primes £ of the local factors 
above. In |Zy| , Zywina made the observation that the probabilities are not independent from 
one prime to another, because the fields Q(E[£]) are never independent for all primes £, as 
already observed by Serre in [Se]. Their dependence can be quantified: for each elliptic curve 
E/Q, there is an integer Me which has the property that the probabilities are independent for 
primes £ \ Me (see |Zy| for a precise definition) . 

Now let G{M E ) C GL a (Z/AfeZ) be the Galois group of Q(E[M E ])/Q and let 

n(M E ) := {g G G(M E ) : det( 5 ) + 1 - tv(g) = 0(modM B )} . 
For each elliptic curve E, the constant C(E) of Koblitz's Conjecture is then expected to be 

1 MM E )\ 

\g(m e )\ n / e-£-l \ 

C{E) ~ T r 1 X 11 V 1 - _ 1)3 (£ + 1) J " (2) 



n 1 - 7 ^ 



We remark that even if M E is never 1, it is possible for some elliptic curves E/Q to have 

£ 2 - I - 1 



c(js) = n (i 



1) 3 (^ + 1) 



as shown in [Jo] and |Zy| (it is shown in [Jo] that C{E) = £ when the square- free part of the 
discriminant of E is congruent to 2 or 3 modulo 4). However, the average constant (£ of Theorem 
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[JJ should not be thought of as the constant of any given curve over Q, but as the average of all 
the constants C{E). Indeed, in [Jo] . N. Jones shows that if one assumes a positive answer to 
a well-known question of Serre regarding the open image theorem for elliptic curves proven in 
[Sej . then the average of the conjectural constants C{E) of ([2]) is indeed the average constant 
of Theorem [TJ Our result then gives evidence for both the asymptotic of Koblitz's Conjecture 
and the constant appearing in the conjecture. 

3 Reduction to an average of Kronecker class numbers and 
proofs of Theorems [TH2] 

Let x > and let C be the family of elliptic curves introduced in Theorem [TJ In this 
section we show how the average of n t ^' m (x) over E/Q reduces to an average of Kronecker class 
numbers, which is Proposition [7] of the present section. 

First we write 

±-J2*E in (*) = W\ E N AB,r(p), (3) 

p+1 — t prime 



where 



Na,b,Ap) :=#{H <A,\b\ <B :a p (E(a,b))=r}. 



As the value of a p (E(a,b)) depends only on a and b modulo p, Na,b,t{p) can be evaluated by 
counting elliptic curves over the finite field ¥ p and having p + 1 — r points for each given r. In 
turn, this can be done using the following results: 



Theorem 4 (Deuring's Theorem IDetf ) 

Let p > 3 be a prime and let r be an integer such that r 2 — Ap < 0. Let S r {p) be the set of 
Fp-isomorphism classes of elliptic curves over ¥ p having p + 1 — r ¥ p -rational points. Then 

Ed£ r (p) W ^ ' 

where Aut(-E) is the automorphism group of E and for any D < 0, H{D) is the Kronecker class 
number 

KD/f) 



H(D) := £ 



w(D/p 

D 
J 1 



defined in terms of the class number h(D/f 2 ) and number of units w(D/f 2 ) of Q(y / D / f 2 ) . 
Then, for any fixed —2^/p < r < 2^/p, there are exactly (p — l)H(r 2 — Ap) elliptic curves defined 
over F p with p + 1 — r points. 

Lemma 5 Let D be a positive integer such that —D = 0, l(mod4). Then, as D — > oo, 

H(-D) < D 1,2 \og 2 D. (4) 

Proof. This follows from the class number formula and from standard bounds on special values 
of Dirichlet L-functions.D 
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By reducing modulo p and counting curves over F p , it follows that 

N A ,bAp) = (y + 0(1)) + 0(1)) (ptf(r 2 - 4p) + O (v^log 2 ?)) . (5) 

This is the approach of [DaPaj which gives an average of size A, B > x 1+£ when one replaces 
([5]) in (j3J). In order to get the shorter average of Theorem [Q one needs to count Na,b,t{p) hi 
a more subtle way, using character sums. This was done by Baier, who proved the following 
lemma: 

Lemma 6 Let Na,b,t{p) be the number of curves E(a, b) £ C such that a p (E(a, b)) = r. Then, 

AAB-H(r 2 -4p) n ( AB AB-H(r 2 -Ap) . n 

Na,b,Ap) = " — + + \ ^i + A + B 

p \ p p A 

+(AB ■ H(r 2 - 4p))V2 i og 3 p + (d + Z ^ l ogp \ . 

Proof. See |Bai] . □ 

Replacing Lemma in © and using Lemma \E\ to bound the Kronecker class numbers, we 
obtain that 

1 V- twin/ \ v- H(r 2 - 4p) / 1/2 x 3 / 2 log 3 x x 3 / 2 log 3 x x 7 / 4 log 4 x \ 

\c\^/ E {x) ~ ^ p +u r + A + A + (AbW 2 J 

— r prime 

p<x, |r|<2^ 
— r prime 

provided we take A,B such that A, B > x l / 2+2e and AB > x 3 / 2+2e . 

In a similar way, using Deuring's Theorem and the bound of Lemma [5j we can write 

^V(p) = pH{r 2 - 4p) + (x 2 log 2 x) . (7) 

p+1 — r prime 

Thus, once again, we need to evaluate an average of class numbers. Then, Theorems [U and [2] 
will follow from: 



Proposition 7 Let x,X,Y be positive real numbers such that X + Y < x, Y > yX. Then, for 
any M > 0, 

£ £ P H(r 2 -4p) = ^ + 0(XY 2 lo S X) + o(-4r-) 

x <P Tx+Y ir^vx log X Vlog x7 

p+1 — r prime 

where £ is the constant of Theorem{J\ 

Now let us indicate in detail how Proposition [7] implies Theorems Q] and [2j 
Proof of Theorems [H21 Comparing ([6]) and ([7]), we see that the two asymptotics to prove 
are equivalent by partial summation; we will only prove the second one. 

Let M be any positive integer (e.g. M = 10 suffices) and K := [log M//2 a+ Let 

Y := and X := kY for < k < K - 1. 
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We partition the interval p < x into K intervals of length Y and rewrite the main term of (J7J) 



as 



pH{r 2 -Ap) = E pH{r 2 - Ap) 

P <x 0<k<K-l x< P <x+Y 

\r\<2Jp \r\<2^p 
p+1 — r prime r prime 



E E ^( r2 - + ° E xy2 lo s 2 x '( 8 ) 



0<fc<i^-l x< v <x+Y \0<k<K-l 

\r\<2VX 
p+1 — r prime 



where the O-term comes from the bound on the Kronecker class number H(r 2 — 4p) for r 
in the interval 2\[~X < r < 2^/p and is bounded by K 2 Y 3 log 2 x < xlog 2 ~ M ^ 2 x. For the main 
term we use Propostion [7] with the same M, and obtain: 



£ E ^(- 2 -4p) = er 3 E + E 

-1 — r primi 
\r\<2VX 



/,■• K-\ A ,._.vii 1 1,<K-1^°& \ I /, A -I 

p+1 — r prime 



x 3 



,l<fc<K"-l S / 

cy 3 / ' - dt + o' 



\og 2 {tY) \\og M ' 2 - 1 



X 



. , rx u 2 , ^ / x 3 
£ / — tt— du + O 



Y \0g 2 {u) \\og M l 2 - l X 



£x 3 „ / x 3 



+ — 3- . (9) 



3 log 2 x V log 3 x , 

Replacing ([8]) and ((HJ) in ((TJ) , the proof of Theorem [2] is completed. The proof of Theorem [T] 
follows by partial summation. □ 



4 Reduction to an average twin prime conjecture and proof of 
Proposition \7\ 

In this section we show how Proposition [7] reduces to an average of the twin prime conjecture 
which, in turn, will be proved completely in Sections [5]l6l To be precise, our proof of Proposition 
[7J relies on the validity of the following result: 

Proposition 8 Let x > and let M > 0. Let X, Y, R, U, V be parameters depending on x and 
satisfying 



X + Y < x, R<x, x 1/2 log N x<U, log N x<V, UV 2 < x log' 



jV 



X. 



There exists an integer N(M) > such that, if N > N{M), then, as x — > oo, 

E h E O E logp-log(p + l-r) 

M<« J a(mod4n) x< P <x+Y 

f<V p+1 — r prime 

n<U p=(r 2 -af 2 )/4 (modn/ 2 ) 

= 2£RY + O f-^- + x 4 / 3+e ) , (10) 
\log M x J 
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where (£ is the constant of Theorem{Jl 

We assume this result as true and proceed to proving Proposition [71 

Proof of Proposition Let x, X, Y be as in the statement of Proposition [7J Using the class 
number formula, we write 

52 52 pH(r 2 -4p) = i- 52 7 E *pV¥^L(l, Xd ), (11) 

X<p<X+Y |r|<2VX \r\<2^X * X<p<X+Y 

P+l — r prime f<2y/x 

where d = d(r,p, f) := (r 2 — 4p)// 2 and the * on the summation over p indicates that we are 
summing over primes X < p < X + Y such that 

p + 1 — r prime, / 2 | r 2 — 4p, and d = 0, l(mod4). 

Here, Xd denotes the Dirichlet character modulo d defined by the Kronecker symbol and L(s, \d) 
denotes its Dirichlet L-function. 

Using the Polya- Vinogradov inequality, we write the special value L(l, Xd) as 

Xdin) Xd(n) 



L(i,xd) = 2^—- + 2^ 



n 1 — ' n 

n<U n>U 



y^Xd(n) , Q ( V\d\log\d\ \ 

n<U n V U J 

where U = U(x) is a parameter to be chosen soon. By plugging this into (|lip . we obtain that 



52 52 pH ^ 2 - 4 ^ 

X<p<X + Y |r|<2vOC 

p+1 — r prime 

= ^Z S E *pV^P-r 2 Xd(n) + l- 

\r\<2^X J X<p<X + Y \ 
n<U 

Thus, by taking 



.7/2 



u 



(12) 



: 1/2 log M+1 x < U < x, (13) 



the O-term above becomes O (x 3 /log M x). 

Now let us also truncate the sum over /. We write 

7T ^2 ~7 E PV^P - r2 Xd(n) 

\r\<2^X X<p<X + Y 



— 52 ~~e Yl PV^P - r 2 Xd(n) + O 

|r|<2vOf J X<p<X + Y 
f<V 



1 1 * n 9 / \ ^ f x 3 logU\ 

^ 2^ —f 2_, 4p - r 2 X d(n) + O I 1 . 



^ ^ £ ]> 

|<2vOf X<p<X+y 
V<f<2^5 p=r2 {mod/ 2) 



3/2 



2vr 

|r|<2vOf J X<p<X + Y 
f<V 
n<U 



(14) 
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On the second line we used that since r is odd and f 2 \r 2 — 4p, we must have that / is odd, 
hence the condition in the sum over p that 4p = r 2 (mod f 2 ) becomes p = 4r 2 (mod / 2 ), where 4 
is the inverse of 4 modulo f 2 . The other estimates used for the O-term are elementary. 
We choose V such that 

^ >(log^) (M+1)/2 (15) 

and then the O-term above becomes O (x 3 / log M x) . 

Now we use quadratic reciprocity and consider Xd(n) as a character modulo An. Hence we 
rewrite the main term of (1141) as 



h E E © E 

\r\<2Vx J a(mod4n) X<p<X+Y 
f<V 
n<U 

where the ** on the summation over p indicates that we are summing over primes X < p < X+Y 
such that 

p+ 1 — r prime, f 2 \ r 2 — 4p, d = 0, l(mod4), and 5 — = a(mod4n). 

/ 

Since r and / must be odd, we necessarily have d = l(mod4); thus ** is equivalent to the 
conditions 

p +1 — r prime and p = (mod nf 2 ) . 



We now change the weight of the primes p from p\J 4p — r 2 to 

logp ■ log (p + 1 — r) 



\og 2 X 

Then the main term of (|14p becomes 

X x 1 r— v /a 



X\/AX 



—JAX-r^ Y, (-) E **logp-log(p+l-r) (16) 



27rlog 2 X nf v ,^~f., , 

to \r\<2Vx a(mod4n) X<p<X+Y 

f<v 

+ 0(XY 2 log A") 

and so we reduced our question to an average of the standard twin prime conjecture, twisted by 
some Kronecker symbols. This average is evaluated using Proposition [8] stated in the beginning 
of this section; the details follow. 

Let us write the left hand side of Proposition [8] as F(r). With this notation, the main 

[r|<fl 

term of (I16p becomes 

X 



Y F{r)s/l~X 



2vrlog 2 X 

\r\<2VX 

which we can compute from Proposition [8] by partial summation. We obtain: 
X 



V F{r)V4X 
2vrlog 2 X y 1 



X 



XF /*^t J (4X-tV /2 <B + o('- (Xl,S/2 ^ 



27rlog 2 X Jo 



vr log 2 X 7 ' ' I log 2 X log M x 
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Evaluating the integral 



/ t 2 (4X -t 2 )- 1/2 dt = 4JC t 2 (l-t 2 )- 1/2 dt = irX 
Jo Jo 



and replacing in the above, we obtain 



F(r)V4X-r2 = + 0' 



2vrlog 2 X ' log 2 X \ (log 2 X)(log M x) / ' 

fa \r\<2VX \V 6 A 6 V 

Using this together with (|16p . the proof of Proposition [7] is now completed (provided that 
Proposition [8] holds). □ 



5 Average of the twin prime conjecture and proof of Theorem 

m 

In this section we shall prove TheoremEl The statement is a Barban-Davenport-Halberstam 
type distribution result for twin primes, where the average is over the twin prime differences. 
The main (and difficult) part of the proof is the case Q = 1 of Theorem [3j A version of this was 
proven by Perelli and Pintz in [PePij . Beside minor cosmetics, their result differs from what 
we need in two aspects: rather than a Goldbach type problem, we have a twin prime problem; 
more importantly, we need a Siegel-Walfisz type analogue, namely: 

Proposition 9 Let e, M, N > be fixed. Then there exists x(e, M, N) > such that, for any 
x > x(e, M, N), xV3+£ < R<x, q< log N x, (a,q) = l and < X < X + Y < x, we have 



E 

0<r<R 



logp • logp' — 6(r, q, a)Y 



X<p<X+Y 
p = a(mod q) 
p — p f =r 



Rx 2 
log M x 



The proof of Proposition [9] is similar to the proof of [PePi|, Theorem 1]. For this reason, 
we will only indicate the major steps that should enable the interested reader to modify |PePi] 
accordingly. After we complete the proof of Proposition [9l we proceed to proving Theorem [3j 



Proof of Proposition ® We use the following notation (which is, unfortunately, not exactly 
the same as in [PePij ): 

Si (a) := Y lo &P " e (P a )' ^2 (a) := ^ logp • e(p'a), e(y) := e 2my , 

p'<x 



X<p<X+Y 
p = a (mod q) 



C := C(e, M, N), I s b '■= Farey arc around - 

s 



b . log 2C x 

- +77, r) < 

s sx 



s<log c x {b,s)=l 
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By the circle method, we have 



logp- logp' = / Si(a)S2(— a)e(— ra) da, 

s v I v « 



p=a(mod q) 
p — =r 



and so 



E 

0<r<fi 



logp ■ logp' — &(r, q, a)Y 



X<p<X+Y 
p=a(mod q) 
p — p f = r 



« E 

0<r<R 



Si(a)S2(—a)e(—ra) da 



+ E 

0<r<R 



S\(a)S2(— a)e(— ra) da — &(r, q, a)Y 



(17) 



The estimate for the contribution of the minor arcs - the first term in formula (|17|) - is 
identical to the one in [PePij . To start, we remark that the Cauchy-Schwarz inequality and the 
well-known estimate E^ <r<_R e ( r v) ^ min(i?, l/||y||) reduce this term to 



E 

0<r<_R 



Si{a)S2{—a)e(—ra) da 

[ S 1 (a)S 2 (-a)e(-ra)da f 5i(/3)5 2 (-/3)e(r/3) dp 

da d(3 



0<r<R 



!.S',.M).S 2 (,;)| / |5i(a)5 2 (a)|minf R, 

2 \ 



1 



1 



\a - 

1/2 



da 



< sup I / \S 2 (a)\ z mm R, 
/3em \Jm \ \\ a 

where |||| denotes the distance to the nearest integer. Now let us observe that our 82(a) is 
exactly the same as the one in [PePij . thus the third integral above can be estimated as in 
[PePil Section 5]. Note that the function 82(a) plays the crucial role, while the somewhat 
different S±(a) only appears in Parseval's identity. Since our S±(a) has smaller L 2 -norm than 
its analogue in |PePi] . the arguments in [PePil Section 3] provide the necessary bound in our 
case as well. 

For the calculation of the major arcs - the second term in (|17|) - we follow the exact steps of 
[PePH Section 4]. First, for 



+ V £ Is,b, we use the Siegel-Walfisz theorem to approximate 



the function 



Si(a) = ^2 logp ■ e (p ( ^ + r] j j = ^ e [~~) ^2 l °gP- e (PV) 

X<Tr>< XA-Y ^ J J 1<C<S X<'n< X 4-Y 



X<p<X+Y 
p=a(mod q) 



X<p<X + Y 
p = a( mo d q) 
p=c(mod s) 



by 



i<c<s \ ' X<n<X+Y 
(c,s)=l 
(q,s)\c-a 
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and the function S2 (a) by 



v ; l<c< s V / n < x rs > ' n < x 



(c,s)=l 

Here, u(-) denotes the Mobius function. 

The estimates for the error terms in our resulting analogue of [PePi} (7)] are identical to 
the ones described in [PePil Section 4]. For the main term, the only difference is in the singular 
series, which now originates in 

M s ) (~ r b\ f be 

s 



e ^pra E •(=)■ 



- S (M)=l (c,s)=l 



c — a 



We proceed as follows. 

The standard argument via the Chinese Remainder Theorem shows that the function 



F(s;r;q,a) := £ e (-^\ £ e (^) 



l<f)<S V ' 1<C<S 

(6,s)=l (c,s) = l 

(q,s)\c~a 

is multiplicative in s. Indeed, for s = uv, (u,v) = 1, uu = l(modu) and vv = l(modu), note 
that the relation c = guu + hvv establishes a bijection between the reduced residue classes c 
modulo uv and the pairs of reduced residue classes g modulo v, h modulo u. Similarly, the 
relation b = du + fv establishes a bijection between the reduced residue classes b modulo uv 
and the pairs of reduced residue classes d modulo v, f modulo u. Thus 

bc — rb\ ( {du + fv)(guu + hvv) — r{du + fv)\ ( dg — rd\ ffh — rf 



uv J \ V J \ u 



Moreover, (q, uv)\c — a if and only if (q, u)\h — a and (q, v)\g — a. 

Observe that we are only interested in square-free s, thus it is enough to know F(jp; r; q; a) 
for a prime p. A routine computation shows that 



F(p;r;q,a) = e ( ^— ^ J 



l<6<p-l i<c< P -i 

(?>p)l=-o 



p — 1 if p|g, p\a — r, 

-1 if>|g,pfa-r, 
-p+1 if p\q,p\r, 
1 if p { q,p \ r. 



Now one can easily check that, after extending the sum over s in (|18l) up to infinity, we have 
Proposition [9] then follows. □ 



Proof of Theorem [SJ Let us observe that the expected density of twin primes of distance r 
is &(r). If a(modg) is an admissible residue class, then the expected density of twin primes of 
distance r in the residue class o(mod q) should satisfy 

p(r, q) 
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whenever (a, q) = (a — r, q) = 1, where 



p( r> g) : = #{ a (mod q) : (a, (?) = (a - r, (?) = 1}. 

To see this, let us evaluate p(r, q). On one hand, we have that this function is multiplicative 
in the second variable q. Indeed, let q = uv with (u, v) = 1 and note that by the Chinese 
Remainder Theorem, the relation b = cuu + dvv establishes a bijection between the reduced 
residue classes b modulo uv and the pairs of reduced residue classes c modulo v, d modulo u. 
Here, uu = l(mod-u) and vv = l(modw). The multiplicativity then follows from the fact that 
(b — r, uv) = 1 if and only if (c — r,v) = {d — r,u) = 1. On the other hand, we have that 



( r „<*\ = v i = <! p °~ pa ifp|r ' 

PV>P ) 2^ \ p«-2p a - 1 ifp + r. 

i<b< P a y r r ^ 1 

P \b 

p\b — r 

Then our claim follows from the equations 



p-2 

' 11 

p\q,p\i 

S(r) S(r) -i-i- p — 1 2 -i-r p(p — 2) j-jp - 1 -r-i- p—1 



P (r,q)= n (p"-^- 1 )- n (p a -ip a - i )=Hq) n E rp ( i9 ) 

p a ||g,p|r p Q ||g, pfr p\q,p\r 



/o(r,g) 0(g) ^ p-2 <p{q) (p - I) 2 11 p-2 J-JLp-2 

Now let us extend the definition of p(r, q) to characters modulo q: if x is an Y non-principal 
character modulo q, let 



p( r ,x) ■= x ^ = ^2 x(b)Xo(b - r); 



l<b<q l<b<q 

(b-r, 9 ) = l 



if xo denotes the principal character modulo q, let 

p(r,Xo) ■= p(r,q). 

By the orthogonality of characters, we obtain that 

6(r) 6(r) r _, ^(r, x) , on N 
6(r, q, a) = — = —— > x o -? r> 20 

p(r,g) 0(g) p{r,q) 

A, 

where this formula also incorporates all the conditions that 2\r and (a, q) = (a — r,q) = 1. This 
simple representation plays a crucial role in the following computation. 

Let e,M > 0, N > N(M), x > x(e,M), x l / 3+e < R < x, Q < xlog~ N x and < X < 
X + Y < x be fixed, as in the statement of Theorem We define, for any (even) integer r and 
character 

F (r,x) •= x(p) logp • logp'. 

X<p<X+Y 
p — p 1 =r 

Then, from the orthogonality of characters we obtain that 

^2 log p- log p' = -^—^2x(a)F(r,x)- 

X<p<X+Y y J \ l iJ ^ 

p=a(mod q) 
p — p f — r 



14 



Usually, the main term comes from the principal character and the contribution of the rest 
is small due to the oscillation of the characters. Unfortunately, our situation above is more 
complex and we need to compute a dispersion over all characters, as follows. 

The left hand side of Theorem [3] can be transformed (using (|20p and orthogonality) into: 



° ■- E E E 

0<r<R<j<Q l<a<q 



E E E 

0<r<Rq<Q l<a<g 



logp • logp' — S(r, q, a)Y 



X<p<X+Y 
p=a(mod q) 
p — p f — r 



W)^ x{a) v (r,x) 



p(r, q) 



E E^yE^x)- 

0<r<Rq<Q^ yHJ X 

E E^Ei^.rf 



&(r)p(r, x)Y 



p(r, q) 



E E^jEf|»W-.xW«)) 

0<r<Rq<Q rW X HK ' 



+ E E Mq) E 

0<r<Rq<Q^ KH > X 



l_^6(rf\p(r,x)\ 2 Y 2 

<Kq) 



p(r, qf 



By the orthogonality of characters, 

7^El^ r 'X)| 2= E E T7-TE^ 6 )^ C ) = / 5 ( r ' ( ?)' 



(b — r,q) — 1 (c — r,q) — 1 



and then the last term in £ simplifies to 



0<r<Rq<Q 



p(r,q) 



More importantly, 



Hi) 



^2 E l °gp -l °gi' / T^ExO'M & ) 



x< P <x+y i<6< 9 

p — p' — r (h— r,q) — 1 



0(9) 



logp- logp' = logp • logp' + O (log 2 x). 



X<p<X+Y 
p — p 1 =r 
(pp',q) = l 



X<p<X+Y 
p—p f =r 



The expected asymptotic for this last sum is &(r)Y, which is indeed true on average over r 
from the case of a = q = 1 of Proposition [9l 

Also, from &(rq) <C G(r)&(q) and from the fact that 6(r) is 1 on average, that is, 

£ 6(r) = 0(E) 
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and 



we deduce that 



r<R 



E E^yEf|»( F (,xW,x)) 

0<r<Rq<Q YV1 ' X HK ' 

~ ^ ^ p(r a) \ l off M r 

Putting everything together, we obtain that 

s= E E^jEi^,x)i 2 -v- E E^ + o(^). 

0<r<Rq<Q nq) X 0<r<Rq<Q P{ ,q) V l0 § Xj 

Note that nothing deep has happened so far beside the one application of Proposition [9j 
we have utilized only the basic properties of Dirichlet characters. Now we need to show that 
the first and the second terms are asymptotically equal, that is, we need to be exact in our 
computation. 

As a first step we define 

c </.«)-E^ pi) 

q<Q rW 
fW 

which satisfies 

C(/,Q) <-^y log Q. (22) 

We also observe that, if x mod q is induced by the primitive character x* mod /, then, due to 
the fact that F(r, x) is a sum over primes, we have 

F(r, X ) = F(r, X *)+0(log 2 x). 

By rearranging the first sum in S according to primitive characters and using the above, we 
then see that 

s= E Ec(/.e)E>P.*)i 2 ->" 2 E E^+°(^). 

0<r<Rf<Q X 0<r<Rq<Q P{ ,q) V1 °§ X / 

where YJ* is a sum over all primitive characters modulo /. 

Now for any fixed (even) < r < R, we use the large sieve inequality to estimate 



E C(/,Q)^>(r, X )| 2 « ^+QVlog 3 x« 



x 2 



log M x 



Qo<f<Q x 
where Qq := log _M_3 x and N > M + 3. This implies that 
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s= E Eoc/.wE'i^x)!'-^ E E§rS + o / '"' 



<Kr<R,<Q, X 0< r <R,<Q''( r '' !) Vlog"l 

Using the notation 

1>(X,Y;r,f,b) := £ logp-logp', (23) 

x<p<x+r 

p = b(mod /') 
p—p f =r 

E(X,Y;r,f,b) := ^(X,Y;r, f,b) - 6(r, f,b)Y, (24) 

we see that 

F(r, X ) = Yl x(bMX,Y;r,f,b) 
!<&</ 

= £ X (b)6(r,f,b)Y+ x(b)E(X,Y;r,f,b) 

1<6</ 1<6</ 
(6-r,/) = l 

&(r)Y ( \ 

-p(r,x) + If max F; r, /, 6)| . 



pfa/) ' V (&>/)=! 

For any small / and b, the sum of \E(X, Y; r, /, b)\ over r is sufficiently small by Proposition 
[9] consequently, the same is also true for the sum over /. 

Additionally, we can evaluate p(r, %) for a primitive character. It is well-known that for a 
primitive character \ modulo / and for all d\f, d ^ f, we have S Kc <j^x( r + cc = (see 
[Dal Chapter 9]). Using this, we quickly infer that 

p(r,x)=Y,^ b ) E P(d) = J2Kd) E X(b) = p(f)x(r)- 

1<6</ d\{b-rj) d\f i^</ 

o-r(mod a) 

Putting everything together, we arrive at the equation 

-EE W|^E*^EEfS+°G5y. 

0<r<R /<Qo ^ ,J) X 0<r<Rq<Q PK ,q) X / 

(r,/)=l 
/ is squarc-frcc 

Let us denote the number of primitive characters modulo / by </>*(/). We note that this is 
a multiplicative function for which 4>*(p) = p — 2. The sum over / is a quickly converging sum 
by (fl9l) and (|22|) . so we can drop the condition f < Qo for a price already paid by the error 
term. Writing back the definition of C(f,Q), we obtain, after a little rearrangement, that 

/ \ 

Rx 2 



S = Y ' 2 E s<"> 2 E^y 

0<r<_R q<Q ^ y ^ J 



y W) HQ) 

7^ P(r, f) 2 p(r, q) 

\ (r,jf)=l . / 

\ / is square-free / 



O 



log M x 



Finally, let us notice that by (|19p we actually have inside the big parantheses, as everything 
is multiplicative and 

[r,f)=l p\r p\r 

f is square-free 

This completes the proof of Theorem [3j □ 
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6 Proof of Proposition [8] 



This section consists of a proof of Proposition El This is done in two parts: an estimate of 
the error term in Proposition [HI which relies on Theorem and an estimate of the main term, 
which consists mainly in the computation of the constant £. Note that in the course of proving 
Proposition [8J we can always assume R > x 1//3+<E and Y > y/X, as otherwise the error term is 
an obvious upper bound for all the other terms in (llOD . Note also that the term r = 1 behaves 
differently, as p + 1 — r is always prime in this case. However, any trivial bound (obtained by 
dropping the primality of p, or by bounding the character by 1) shows that this term is much 
smaller than the error term in Proposition [8j therefore it can comfortably be excluded from any 
further investigation. 

Note that, using notation (j24"j) . Theorem [3] can be formulated as 



E E E \E(X,Y; 

0<|r|<_Rij<Qa(mod q) 



r, a, a 



Rx 2 
lo/V 



whenever x 1//3+£ < R < x, Q < x log N x, and X + Y < x. 

Using the same notation, as well as ()23f) . we rewrite the left hand side of (jlOh as 



E i E 0^,y;r-l,n/ 2 ,(r 2 -a/ 2 )/4) 

Ci?,r#i J a ( mo d4n) 
f<V 

n<U 

= Y E —. E l,n/ 2 ,(r 2 -a/ 2 )/4) 



nf * — ' vn/ 

|r|<fl,r^l J a (mod4n) 
f<V ' 
n<U 



+ E -< E 

z — ' nt \n 

\r\<R,r^l J a (mod4n) 
f<V 
n<U 



E(X,Y;r-l,nf 2 ,(r 2 - a/ 2 )/4). 



(25) 
(26) 
(27) 



6.1 Estimate of the error term in Proposition [H] 

In what follows, we will show how Theorem [3] allows us to control the error term (|27[) . First, 
using the Cauchy-Schwarz inequality, we obtain 



E h E (~)E(X,Y;r-l,nf,(r*-af)/A) 



nf * — ' \n 

\r\<R,r^l a(mod4n) 
f<V 
n<U 



1/2 



1/2 



< 



Eh 



\r\<R 
\ n<U 
\ fl(mod An) 



n^ 



I 



E E 2 (X,Y;r-l,nf 2 ,(r 2 -af 2 )/4) 



r\<R, r^tl 
\ n<U 
\ a(mod4n) 



/ 



The first inner sum above is estimated trivially as 



1/2 



E 

\r\<R 

n<U 
(mod An) 



< R 1/2 log 1/2 U. 



(28) 



(29) 
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For the second inner sum we observe that 
2 E 2 (X,Y;r-l,nf 2 ,(r 2 -af 2 )/4)< £ £ £ £ 2 (X, F; r - 1, g, 6), 

|r|<«,r^l |r|<R q<4U f 2 b(modq) 

a (mod An) 

as for each fixed f,r,n, the residue classes 

r 2 — Q>f 2 
= : a(mod4n) 

cover each residue class modulo 4n at most once. Then, using Theorem El we obtain 

J2 E 2 (X,Y;r-l,nf 2 ,(r 2 -af 2 )/4) « (30) 
, , 1 log M x 

(ifraod 4n) 

for any M > 0, provided that 

AUV 2 <x\og~ N x. (31) 
Using the estimates ([29]) and ({30]) in ([28]) . we finally obtain that 

E ^ E (i)^,r;r-l,^,(^-^) « ^^T.) 

|r|<B,r#l J n (mod4n) iU & X /<V ^ 

/<v 

n<V 

Rx 

« log (M-3)/2 x ( 32 ) 

for any M > 0. This estimates the error term (|27|) . and thus the error term of Proposition [8] 
(after renaming M). 

6.2 Computation of the constant in Propositions] 



We now treat the main term (|26[) in (j25[) , which is essentially a computation of the constant 
£ in Theorem [1] We first analyse the sum over n and / of (|26p when r is a /ixerf integer. We 
remark that the sum is zero if r is even, thus we can assume that r is odd. We also take r / 1. 

Our goal in this section is to prove: 

Proposition 10 Let r / 1 be an odd integer. Then, 



JL E (£)6 M »f,^. ; .),«) = C , + o(^^) 



f< v J a(mod4n) 



where C r is the positive constant 



oo oo 



' =ln=l ^ ai 



/=ln=l a(mod4n) 



3 I J-X (^_i)3(^ + 1 ) 11 I 1 + £2 _ 2 £-2 J 11 1 1 + ^_ 2 £_2 

\^2 V ' V ; / <Kr-l) V 7 «|r(r- 2) V 
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Proof of Proposition 1101 Using the definition of ©(•, •, •), we rewrite the left hand side of 
the desired equation in Proposition 1101 as 



/ n ^-2)\ ^ ^ 1 

x ' / odd 



n 



i 



c r f (n), 



(33) 



e^2 



where 



4(n):= £ 

a(mod An) 



' fa 

n 



and E^' indicates that the sum is taken over the invertible residues a modulo 4n such that 

{(r 2 -af 2 )/4,nf) = 1 and ((r 2 - a/ 2 )/4 - (r - 1), nf 2 ) = 1 
(r 2 - af, 4nf 2 ) = 4 and (r 2 - af 2 - 4(r - 1), 4n/ 2 ) = 4. 

As r, / are odd and (r, /) divides (r 2 — a/ 2 , 4n/ 2 ), we must have that (r, /) = 1; in this case, 

i/ 2 ,4n/ 2 ) = 4 <=> (r 2 - a/ 2 ,4n) = 4. 



(r 2 — a } 



Similarly, as (r — 2, /) divides (r 2 — af 2 — 4(r — 1), nf 2 ) = ((r — 2) 2 — af 2 ,nf 2 ), we must have 
(r — 2, /) = 1; in this case, 



Then 



((r-2) 2 -a/ 2 ,4n/ 2 ) =4 
E 



c/(") = < 



a(4n)* 
(r 2 -o/ 2 ,4n)=4 
((r-2) 2 -a/ 2 ,4n)=4 





((r-2) 2 -a/ 2 ,4n) =4. 



if (r,/) = (r-2,/)=l, 



otherwise, 



where a(4n)* denotes invertible residue classes a modulo An. 

To continue the proof, we need some properties of the function c r j(n): 

Lemma 11 Let r / 1 be an odd integer and let f be a positive odd integer such that (r, /) 
(r — 2,/) = 1. Let c r Jri) be as defined above. The following statements hold: 



1. if n is odd, then 



cj(n) 



E 



a(mod n) * 
(r 2 -a/ 2 ,n) = l 
((r-2)2-a/2, n )=l 



where a(modn)* denotes invertible residue classes a modulo n; 

2. c r j(n) is a multiplicative function of n; 

3. if I is an odd prime and (I, f) = 1, then 



fia—l 



1 — 2 if a is even and i \ r{r — 2)(r — 1), 

£ — 3 if a is even and £ \ r(i — 2)(r — 1), 

—1 if a is odd and £ \ r(r — 2)(r — 1), 

— 2 if a is odd and £ \ r(r — 2)(r — 1); 
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4- if £ is an odd prime and I \ f (which implies that (£,r) = (£, r — 2) 
on f ), then 

c/(_n _ f if a is odd, 
j i — 1 if a is even; 



fa-l 



1 by the hypotheses 



5. 



c r f {2 a ) 



la— 1 



(-!)«. 



Proof. 

1. If n is odd, then 



E 

a(mod4n)*, a— 1 mod 4 
(r 2 -a/ 2 ,n) = l 
((r-2)2-a/2,„) = l 



E 



a(mod n) * 
(r 2 -a/ 2 ,n) = l 
((r-2) 2 -a/ 2 ,n) = l 



n 



where the last equality follows from the Chinese Remainder Theorem and the fact that 
^— ^ = (^—j when a% = a2(modn) for n odd. 

2. Let ni,7i2 be two co-prime positive integers with m odd, and let n = n\U2- Then, using 
the Chinese Remainder Theorem, we obtain 



;(ni)cj(n 2 ) 



E 



a-^ (mod n\^)* 
( (r _2) 2 -a 1 / 2 ,n 1 ) = l 



E 



a (mod 4n^ rto ) * 
(r 2 — a/^ t 4 ni ^2 ) = 4 
((r-2J 2 -a/ 2 ,4n 1 n 2 )^4 



ni 



E 



a2 (mod 4n2 )* 
(r 2 -a 2 / 2 ,4n 2 )=4 
( (r _2) 2 -a2/ 2 ,4n 2 )=4 



c/(nin 2 ) 



3. We have that 



c r f (r) 



«o-l 



E 

o(mod«)« 
(r 2 -a/ 2 ,f) = l 



/ 



«o-l 



E (7 

a(mod £)* 



V 



E 



a(mod * 
a = / — 2 r 2 (mod or 
a = /- 2 (r-2) 2 (mod / 



.(34) 



where / denotes the inverse of / modulo I. We then need to count the number of invertible 
residues a modulo I which are eliminated by the two congruence conditions of the second 
sum. If r = 0,1, 2 (mod 4), there is exactly one such residue (notice that r 2 = (r — 
2) 2 (mod^) <^=^ r = l(mod^)). In all three cases, this residue is an invertible square 
modulo £, and the second sum on the right hand side of (|34p has value +1. The result 
follows immediately when a is even, and follows from the orthogonality relations when a 
is odd. If r ^ 0, 1, 2(mod4), there are two invertible residues which are eliminated by the 
two congruence conditions on a, and the second sum on the right hand side of (I34h has 
value +2. The result follows as above. 



4. We have that 



;(0 



act— 1 



E 

a(mod £)* 
(r 2 -af 2 ,t) = l 
( (r _2) 2 -a/ 2 J) = l 



no.— l 



E 

o(mod €)* 
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since (r 2 - af 2 , 1) = ((r - 2) 2 - a/ 2 , = 1 for all a when £ | / and (r, f) = (r- 2, /) = 1. 
The result follows immediately when a is even, and using the orthogonality relations when 
a is odd. 



5. Let a > 1. Since ^— J is a character modulo 8, we write 



c r f (2 a ) 



TO— 1 



E 



a (mod 8) * 
(r 2 -a/ 2 l 2"+ 2 )=4 
((r-2)2_o/2,2"+2)=4 



io- 1 



io- 1 



(-1)« 



□ 



Using parts 3. and 4. of Lemma [TT| we can write 

/ 



EE 

f<V n <U 

f odd 



1 



nf4>{np 



n 



i 



c}(n) 



yt\nf2(r- i 



f^2 



E E 



1 



n 



/=i n=l J t \ J / i ^ n/2(r _ 1) 

(2,/) = (r,/) = (r-2,/) = l \ e ^ 2 



= :I? r + 0(i f + 
as m [DaPaj . Note that 



1 



C r = 2f] 



£^2 



1(1 - 2) 

WW 



D r . 



The rest of this section consists of writing D r as an Euler product. We first write the sum 
over n as a product. In order to have multiplicative functions of n, we use the formulas 

0(n)<A(/ 2 )(n,/ 2 ) 



n 



<?|n/ 2 (r-l) 
1^2 

Now we rewrite D r as 
/ 

oo i 

^ /0(/ 2 ) n 

I / odd « 2 

\(r,/)=(r-2,/)=l 




n 



e\f 2 (r-i) 
e^2 



n 



^K",/ 2 ('--i)) 

^2 



£- 1 



E 



^(n) «/>((/ 2 ,n)) 



^n^(n) (/ 2 ,n) 



/ 



n 

,e^2 



£-1 



1-2 



n 



V <|(n,/2(r-J n 



1^2 



The sum over n is the sum of a multiplicative function of n whose factors at prime powers 
£ a depend on the divisibilities of /, r, r — 1 and r — 2 by I. Using Lemma [HJ we can then write 
the n-sum as 



t\f a=0 



IIEw 



n 



a=0 
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where a r (l) = b r (l) = 1 and for I / 2 and a > 1, 

a r (f*) = 



MO = < 



if a odd, 

(£-l)/(£ a+1 ) if a even; 

-2/£ a (£ - 2) if a odd and £ \ r(r - l)(r - 2), 

(£ - 3)/^ (£ - 2) if a even and i\ r(r - l)(r - 2), 
-l/t a (£-2) if a odd and i \ r(r - 2), 

l/£ a if a even and £ | r(r — 2), 

-l/£ a (£-l) if a odd and ^ | r - 1, 

[ - 2)/£ a {£ - 1) if a even and I \ r - 1. 



Replacing in D r , this gives 



V i a=0 



E 



/ odd 
(r-,/)=(r-2,/)=l 



MP 



n 

<|/2(r-l) 
<?#2 



£-1 

£-2 



— 

4/ J>(0 



a=0 



<|(r-l) 
£^2 



1 



/ odd \ «2 

(r-,/) = (r-2,/)=l 



E 



n 



n 



e^2 



-2 



— 



a=0 



The sum over / in the last expression is a sum of multiplicative functions of /, which we write 
as 

oo 

II E c r(n- 

Here c r (l) = 1 and for any prime £ with £ \ 2r(r — 2) and any a > 1, we have 



C r (£ a ) 



1 /3>0 
/3>0 

1 /3>0 



if l\r-\, 



I3>0 



ti£\r-l. 



Then, 



Dr = nEMn n 



l a=0 



*|(r-l) 
£^2 



£-1 

£-2 



oo 

E 

v e\2r(r-2) a=0 



n 



c r (£ c 



(35) 
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We now compute the sums appearing in (f35j) . using the formulas for a r (£) and b r (£) listed 
above: 

Lemma 12 Let £ be an odd prime and a > 1. Let a r (£ a ), b r (£ a ) and Cr{£ a ) be as defined above. 
We have: 



1. A{£) :=X>(r 



+ £ + l 



o=0 



£{£ + !) ' 

oo 

2. if £\r{r -l){r -2), then B^{£) :=^6 r (£° 



a=0 



? 3 - 2£ 2 - 21 - 1 
(£-2)(£ 2 -l) : 



3. if I | r - 1, then bW(£) := ]T b r (£ a ) 



a=0 



£ 3 - £ 2 - £ - 1 

(£_ 1)2 (£+1 )>' 



24-1) 



* Wr-1, 6«ti|r(r-2), then B<®(£) := y>(0 = ttt ,v w/ -> , 

^ (^-2)(^-l) 

5. * J 2r(r - 2)(r - 1), then C^{£) := £>(r) = 



a=0 



6. i/^ | r — 1, t/ien <?( 2 )(£) := ^c r (H 



1 



a=0 



£3_^2_^_!' 



7. i/£ = 2, ffcen 5(2) := ^&r(f*) = -. 

Proof. All the computations are straightforward, following in one line from the formula for the 
sum of the geometric series. □ 

We extend the definitions of A(£), B^(£), B^ 2 \£), B®(£), C^(£), C^(£) introduced in Lemma 
[12] to any odd prime £ (independently of the relation between £ and r). Then we rewrite D r as 

\^2ci>0 J Ur-i J yfir(r-2)a>0 



B{2) ( \[B^\l)C^{£) ] \\ B^Xl^C^^B&Xt) 



\ 



V 



e\r(i — 2) 
1^2 



»|r-l 
i^t2 



Using Lemma [T2"| we compute 

B^i^C^^B^^C^ii)^ 



2£ 2 - 2£ 



21-2) 



£ 3 -2£ 2 -£ + 2 (£-2)(£ 2 -l)' 

£ 2 - 2£ - 1 1 
1 + 



£ 2 -2^-2 
£ 2 - £ - 1 
£ 2 -2^-2 



1 + 



£ 2 -24-2' 

l + l 
£ 2 -2£-2' 
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Finally, replacing all the above in ([55]) . we obtain 

_ 4 (yr ld-2) £(£ 2 -2£-2) \ n / £ + 1 \ „ / 1 



3 \ 11 (£-l) 2 (£-2)(£ 2 -l)J 11 I £ 2 -2£-2 11 V ^ 2 -2£-2 



li=2 1^2 



4yj £ 2 (£ 2 -2£-2) T-r / £+1 \ TT / 1 



3 11 (^- l) 3 (^ + l) 11 V £ 2 -2£-2) 11 V ^ 2 -2^-2 

£^2 1^2 

This completes the proof of Proposition [TUJ □ 



6.3 The average constant 



Using Proposition [TUl the main term ([26]) of ([25]) is Y ^| r |<^ odd C r ; thus now we need to 
average the constant C r over r. This calculation has similarities with the one done by Gallagher 
in |Ga| for the average of the standard twin prime constant. As such, we will follow the notation 
used in [Gaj . 

The main result of this section is: 



Lemma 13 As R — ► oo, 



Proof. Let us write 



where 



e r {£) 



C r = <LR+0 (log 2 R) . 



\r\<R 
r^l odd 



4y, £ 2 (£ 2 -2£-2) 



n 



[l + e r (£)), 



e^2 

e\r(r-l)(i — 2) 



e^{£) 

e^ (£) if £\r(r- 2) 







if £ | r - 1 

: 4 | r(r - 2 
otherwise 



+ 1 



2£- 




if 



1, 



if £ | r(r- 2), 
otherwise. 



(36) 



Let us also fix the following notation: for r / 1 odd, we take 



V(r) 
V(R) 



{£ odd prime : £ \ r(r — l)(r — 2)} , 
{q positive square- free integer : £ \ q 



U \ r \<n F(r) 

rj^l odd 



€ 7>(r)} 



We want to evaluate 

S: = E II a + ^))= E E e ^), 



|r|<H £#2 

odd «|r(r-l)(r-2) 



|r|<« q ar{r) 

rj^\ odd 



(37) 
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where e r (l) = 1 and, for q ^ 1, e r (g) = JJe r (^). We write 

t\q 

s = E E e ^)= E E E 

qeV(R) \r\<R qEV(R) all possible |r|<R 

r^l odd e — e(g) r^l odd 

er(g) = e 

= E E #{i r i ^ ^ : r ^ 1 odd ' = e > 

q&T>(R) a11 possible 
e=e(q) 

= E E 

g£X>(ii) u=t)(g) % 

where the sum ^ is over all maps v : {£ : £\q} — > {1, 2} and where 

i;=i>(g) 

N(q,v) := #{|r| < i? : r / 1 odd,e r (£) = e v ^\l) W\q}. 

By looking at the conditions imposed on £ when defining e^(£) and e^ 2 \£), we see that 
N(q, v) is the number of integers \r\ < R with r / 1 odd such that 

r = l(mod2), 

r = l(mod£) V^|<? with v(£) = 1, 

r = 0or2(mod£) V% with = 2. 

Therefore, by using the Chinese Remainder Theorem, r as above lies in one of JJ 2 ^)- 1 distinct 
residue classes modulo 2q. Consequently, 



-[]2^)- 1 +0(2^)), 

where w(</) denotes the number of distinct prime factors of q. We plug this in the formula for 
S and obtain 

S = R E - E X[e v{i) {£)2 v ^- l +o[ E ^ {q) E n^W) 

qeT>(R) q v=v(q) i\q \q&V(R) v=v(q) i\q j 

— • <5main ~i~ terror- 

To estimate 5 ma i n , we observe that we have 

<Smain = R E ^fa) 
qeV(R) 

for some multiplicative function G(q). Therefore 

=>(!)(£) + 2e ( 2 )(M 



^main 



£^2 £^2 



£ 3_ 2£ 2_^ + 3 ^3 _ 2^2,^ + 3 
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Now let us estimate 5 error - As for 5 ma in> we observe that we have 



terror = ^ F(q) 

for some multiplicative function F(q). Therefore 

/ \ 

S eTmr = O ]J [1 + 2(e« {£) + (£))] 

1<R 

\ e^2 / 



o 



1^2 



£(£ 2 -21-2) 



0((logi?) 2 ). 



/ 



We put the two estimates together and obtain 

-£3 _ 2 £ 2 -l + i 
_ £{£ 2 -21-2) 



S = R]J 

By replacing (f3"T|) in ([55]) . this gives 



+ 0((lo gj R) 2 ). 



m-r-r £ 2 (£ 2 -2£-2) £ 3 -2£ 2 -£ + 3 



Cr = -yll ( ^_i)3 (£ + 1 

r^l odd 



(^- 1) 3 (^ + 1) £{£ 2 -2£-2) 



+ O (log 2 i?) 



4R tj £ 4 - 2£ 3 -£ 2 + 3£ , 2 , 

" Til FWTI) +0(log B) ' 

which completes the proof of Lemma [13j □ 

Replacing Proposition [TU] and Lemma \T3\ in (|26|) . this concludes the proof of Proposition [HI 
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